The Hash Postfix Table Based Metadata Management Algorithm for Mass Storage System

نویسندگان

  • Xiujuan Li
  • Tao Cai
  • Shiguang Ju
چکیده

It is important for mass storage system to distribute access requests dynamically and balanced between several metadata servers (MDS). Based on analyzing the features of metadata management in the mass storage system, this paper designs the structure of metadata management module, proposes the hash postfix table(HPT) based metadata management algorithm, uses HPT to adjust the distribution of access request dynamically, presents the process of metadata querying and HPT quick adjustment. It analyzes the algorithm from the dynamic equilibrium capability and the time and space overhead. At last, it realizes the prototype, using real data sets to evaluating. The results show that the HPT based metadata management algorithm is more effective and flexible, and can avoid the hotspots.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

FusionProv: Towards a Provenance-Aware Distributed Filesystem

It has become increasingly important to capture and understand the origins and derivation of data (its provenance). A key issue in evaluating the feasibility of data provenance is its performance, overheads, and scalability. In this paper, we explore the feasibility of a management layer for parallel file systems, in which metadata includes both file operations and provenance metadata. We desig...

متن کامل

Scalable Storage for Data-Intensive Computing

Cloud computing applications require a scalable, elastic and fault tolerant storage system. We survey how storage systems have evolved from the traditional distributed filesystems, peer-to-peer storage systems and how these ideas have been synthesized in current cloud computing storage systems. Then, we describe how metadata management can be improved for a file system built to support large sc...

متن کامل

Research of Data Storage and Querying Methods Based on Ring Distribut- ed Hash

In this paper, the main contributions of this work include three aspects. First, the deployment on different datacenters of Impala which is a database based on Ring Distributed Hash. This thesis deploys Impala system on different datacenters across WAN or across regions. Second, the research of data storage and search method based on circular distributed hash. This thesis adopts distributed has...

متن کامل

DDSF: A Data Deduplication System Framework for Cloud Environments

Cloud storage has been widely used because it can provide seemingly unlimited storage space and flexible access way, while the rising cost of storage and communications is an issue. In this paper, we propose a Data Deduplication System Framework(DDSF) for cloud storage environments. The DDSF consists of three major components, the client, fingerprint server and storage component. The client com...

متن کامل

Enabling High Data Throughput in Desktop Grids through Decentralized Data and Metadata Management: The BlobSeer Approach

Whereas traditional Desktop Grids rely on centralized servers for data management, some recent progress has been made to enable distributed, large input data, using to peer-to-peer (P2P) protocols and Content Distribution Networks (CDN). We make a step further and propose a generic, yet efficient data storage which enables the use of Desktop Grids for applications with high output data requirem...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:
  • JDCTA

دوره 4  شماره 

صفحات  -

تاریخ انتشار 2010